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^5^^ BACKGROUND OF THE INVENTION 

. Field of the Invention 
5 This invention relates to the field of signal processing, and in particular to a 

programmable bridge that facilitates the processing of digital signal streams among a variety of 
functional units and one or more digital signal processors. 
2. Description of Related Art 

FIG. 1 A illustrates an example block diagram of a typical system 100 for receiving and 
10 processing a source input 101 for rendering via a user application 150. For example, the source 
input 101 may be a broadcast television signal, and the user application 150 may be the 
rendering of a television program on a display screen. The source input 101 may be an optical 
O signal from a DVD or CD player, and the user application 150 may be a video or analog 
ffi rendering device. The source input 101 may be a satellite or cellular telephone signal, and the 
^ 1 5 user application 150 may be a wireless telephone. 

An analog processor 110 filters and amplifies the analog source input 101, and a digital 
2 to analog converter 120 converts the filtered analog signal to a digital data stream. Optionally, if 

L the source input 101 is a digital signal, the analog processor 1 10 and analog to digital converter 

n 

ffl 120 can be bypassed. 

;^20 A channel decoder 130 receives the digital stream 129 from the converter 120, or fi-om 

y the source input 101 directly, and performs a variety of signal processing functions, generally 

r~i 

related to frequency and sample rate conversion, adaptive filtering, error correction, anti- 
aliasing, and the like. Depending upon the application, the channel decoder 130 may be referred 
to by a variety of altemative names, such as: radio receiver, baseband modulator, digital 

25 receiver, tuner, demodulator, and so on. To illustrate the complexity of a typical channel decoder 
130, an example decoding of a received digital stream 129 into an MPEG stream 139 is 
illustrated in FIG. IB. The received stream 129 is demodulated and equalized to provide a QAM 
symbol stream 133, using techniques common in the art. The QAM symbol stream 133 is 
decoded to produce an MPEG stream as the output 139 of the channel decoder 130. 

30 A source decoder 140 performs application specific functions on the decoded channel 

signal. For example, the decoded channel signal 139 may be an MPEG encoding of a video 
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stream, and the source decoder 140 performs the functions, such as inverse DCT, motion vector 
compensation, and the like, that are related to the conversion of an MPEG signal into a video 
stream that can be rendered on a display device via the user appHcation 150. If the input source 
101 is a telephone signal, the source decoder 140 performs the functions, such as GSM decoding, 
5 to provide a signal that can be rendered to the telephone handset via the user application 150. 

One of the difficulties in a traditional processing system such as illustrated in FIGs. 1 A 
and IB is the fact that standards are still evolving in most application fields. These standards 
typically evolve, or new standards emerge, to support enhanced or additional capabilities. A 
product that supports these additional capabilities will likely command a higher selling price, or 
10 a larger market share, than a 'prior-generation' device that was designed before these capabilities 
were available. The example channel decoder 130 in FIG. IB, for example, corresponds to an 
"ITU A" compatible channel decoder. This decoder 130 includes a sync detector 132 that 
O provides an input to the timing recover device 135 for synchronizing the incoming stream 129. 
^ The de-interleaver 133 provides a de-interlaced signal to the Reed-Solomon decoder 134. The 
^'15 Reed-Solomon decoder 134 also provides a synchronizing signal to the timing recovery device 
M= 135 for synchronizing the packet and frame rates. The de-randomizer 136 organizes the received 

and decoded stream into a coherent input for the formatter 137, which outputs an MPEG- 
^ formatted stream 139. An "ITU B" compatible channel decoder, however, performs the de- 

f s 

m randomizer 136 function immediately after the sync detector 132, and before the de-interleaver 
^20 133. Additionally, in "ITU B", an MPEG-specific timing recovery device (not illustrated in FIG. 
□ IB) is typically used to control the formatter 137, and the MPEG-specific timings are also 
provided to the timing recovery device 135 of FIG. IB. Thus, a change from an "ITU A" 
compatible device to an "ITU B" compatible device requires a somewhat substantial 
architectural change. 

25 Programmable digital signal processors provide the potential of allowing prior-generation 

devices to be reprogrammed to support the latest standard, and/or to provide additional or 
enhanced capabilities without requiring a structural design change. This potential, however, is 
limited to function that can be successfully embodied in a digital signal processor (DSP) in a 
cost effective manner. Some functions require processing speeds that cannot currently be 

30 provided by a general purpose DS; some functions are more efficiently performed in a special 
purpose, or application specific, device because of bandwidth limitations; and so on. 
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Structured design also provides the potential of minimizing the impact of a change of 
requirements for a product. Preferably, a design is structured to allow individual blocks, or 
modules, to be replaced to provide the required additional capabilities, without requiring a 
change to modules unrelated to the changed capability, and without a change to the architecture 
5 of the overall system. The prior generation module is replaced by the latest generation module, 
and the overall system regains its competitive standing in the marketplace. This potential, 
however, is limited to well-contained changes of requirements. Despite efforts to anticipate 
future changes and to provide maximum design flexibility, new requirements often cause a 
restructuring of the system architecture. In some instances, functions that had been unrelated 
10 become related to provide a particular function; new input signals may be required within 

modules that had not previously used these signals; efficiencies become realizable with a new 
architecture that had not been feasible in the old architecture; and so on. 

S BRIEF SUMMARY OF THE INVENTION 

"^15 It is an object of this invention to provide an architecture that facilitates a structural 

change to the architecture without requiring a substantial design change. It is a further object of 

^ this invention to provide a means for modifying the dataflow among functional units in a system 
without requiring a substantial design change. It is a further object of this invention to provide a 

m device that facilitates a programmable dataflow among functional units. It is a further object of 

—20 this invention to provide an architecture that facilitates a reallocation of function among special 

D and general purpose devices as technologies evolve. 

~ These objects and others are achieved by providing an architecture that includes a 

reconfigurable bridge for routing data among functional units. Register transfer units effect the 
routing of data among registers that are associated with each functional unit. Synchronous and 

25 asynchronous register transfers are supported, including interrupt signal generation for efficient 
digital signal processor support. A preferred embodiment of the reconfigurable bridge includes a 
plurality of reconfigurable datapath units that provide ancillary functions to facilitate the 
processing and pre-processing of data items as they are transferred among registers. A preferred 
embodiment of the invention also includes an instruction memory that contains instructions to 

30 effect the desired register transfers and ancillary operations. 
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BRIEF DESCRIPTION OF THE DRAWINGS 
The invention is explained in further detail, and by way of example, with reference to the 
accompanying drawings wherein: 

FIGs. 1 A-IB illustrate an example block diagram of a prior art system for receiving and 
5 processing a source input for rendering via a user application. 

FIGs. 2A-2B illustrate an example transformation of a system dataflow in accordance with this 
invention. 

FIG. 3 illustrates an example block diagram of a channel decoder in accordance with this 
invention. 

10 FIG. 4 illustrates an example block diagram of a structurally programmable bridge in accordance 
with this invention. 

Throughout the drawings, the same reference numerals indicate similar or corresponding 
y features or functions. 

; 1 5 DETAILED DESCRIPTION OF THE INVENTION 

u 

^ FIGs. 2A-2B illustrate an example transformation of a system dataflow 200, 200* in 

accordance with this invention. The example dataflow 200 of FIG. 2 A shows an input INa 201 
L being processed by a variety of functional units F1-F4 210-240 to produce an output Q 209. The 
ffl functional units F1-F4 can represent any of a variety of functions. In the context of a digital 
m 20 television system such as illustrated in FIG. 1, the functions F1-F4 may represent blocks in the 
2 channel decoder that use feedback, via F4 to modify the characteristics of a filter Fl. In this 
example, block F2 may be a demodulator, and the block F3 may extract a parameter from a 
demodulated signal to provide the feedback to block F4. 

As technologies advance, and new features and capabilities become accepted, the 
25 dataflow 200 may need to be modified as illustrated in FIG. 2B to provide these features and 
capabilities as a modified output Q' 209*. As illustrated, the function unit F2 may be found to 
provide a more desirable output when its input is modulated by (multiplied by) a signal 25 1 that 
is a function F5 250 of its output 221. Similarly, the feedback via function unit F4 may be 
preferably combined (added to) another input INb 202. This modified dataflow is provided for 
30 illustration purposes as a modification that would typically require a substantial structural 

change to a conventional dataflow architecture. That is, for example, if the function blocks Fl- 
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F4 of FIG. 2 A were blocks of circuitry on a printed circuit board, the printed circuit board would 
need to be replaced by another board that also contained the function block 250, the multiplier 
260, and the adder 270. If a modular design is used, wherein each function block F1-F5 plugs 
into a common bus network, additional modules corresponding to the multiplier 260, and the 
adder 270 would need to be added to the system. If the modification also required a change to 
the timing or sequencing of the information flow among these blocks, the interface logic in one 
or more of the functional blocks F1-F4 would need to be modified accordingly. 

In accordance with this invention, a system is provided that allows for a modification of a 
dataflow such as illustrated in FIGs. 2A-2B without requiring a substantial change to the 
architecture of the system that supports these dataflows. 

For ease of understanding, a channel decoder is used herein as a paradigm for the 
principles of this invention. FIG. 3 illustrates an example block diagram of a channel decoder 
130' that includes a structurally reprogrammable bridge 350 that facilitates a reconfiguration of 
the dataflow among the components 310, 320 that form the channel decoder 130\ That is, for 
example, the functional units F1-F4 of FIG. 2 may correspond to the function units 320 or the 
DSP 310 of FIG. 3, and the bridge 350 provides the interconnection among the components 310, 
320 to effect the block diagram illustrated in FIG. 2. The bridge 350 in this example takes the 
output of a functional unit corresponding to Fl and provides it as an input to functional unit F2; 
it also takes the output of functional unit F2 and provides it to functional unit F3, and so on, to 
effect the desired flow of data among the components 310, 320 to provide the desired output Q, 
Q' from the source input INa, INb. 

Any of a variety of techniques can be applied the effect the communications among the 
components 310, 320 so that a change of requirements only requires a change to the bridge 350 
of the reconfigurable channel decoder 130'. In a preferred embodiment of this invention, the 
dataflow among components 310, 320 is effected via a register transfer system and protocol, as 
illustrated in FIG. 4. 

FIG. 4 illustrates an example block diagram of a structurally programmable bridge 350 in 
accordance with this invention. The programmable bridge 350 includes a plurality of interface 
registers 440, 450 for interfacing with the DSP 3 10 and function units 320. The DSP 3 10 is 
illustrated as a separate block from the function units 320 for ease of understanding, although 
conceptually it could be considered one of the function units 320. The function units 320 are 



70082 9 PATENT APPLICATION 



Page 5 of 12 



30 May 2000 





typically special purpose functional units that are optimized for their given task. As noted above, 
these function units 320 could include conventional signal processing blocks, such as a baseband 
modem, a tuner, an error corrector, a filter, and so on. As "sofhvare radios" become more 
prevalent, functional blocks corresponding to the functions developed to support software radio 
5 will become common. In accordance with this invention, each of these function units 320 is 
allocated one or more interface registers 450 for communicating data to and from other function 
units 320, to and from the DSP 310, and to and from the external environment as well. For 
efficiency and ease of data transfer, the function units 320 typically operate in a synchronous 
manner, and are often configured in a pipeline-processing manner. The DSP 3 10 is a 
10 conventional programmable digital signal processor, and similarly uses one or more interface 
registers 440 to communicate to and from the function units 320 and the external environment. 
As contrast to conventional function units 320, a DSP often operates effectively and efficiently 
P as an asynchronous, event-driven, device, and the bridge 350 includes synchronizing signals and 
m interrupt signals (not illustrated) for maintaining the appropriate timing relationships among the 
^15 components 310, 320. 

IS s 

H In a preferred embodiment of this invention, the programmable bridge 350 includes a 

^= plurality of register transfer units 420 that each effect the transfer of data among interface 

L, registers 440, 450. In accordance with one aspect of this invention, the register transfer units 420 

Oj are controlled via instructions stored in an instruction register 410. The instruction is of the 



Movel Rs Rd, 

where Rs is the source register from which the data is transferred, and Rd is the destination 
register to which the data is transferred. External inputs and outputs, such as INa, INb, Q, and Q' 
25 in FIGs. 2A-2B are also treated as registers. The Movel instruction also generates an interrupt 
signal to the component 310, 320 corresponding to the destination register, typically the DSP 
310. For example, a program to effect the structure of FIG. 2 A could be written as: 



m20 



general form: 



Move Rs Rd, or 



Move INa Fl.inl, Move F4.outl Fl.in2; 

MoveFl.outl F2.ini; 

Move F2.outl F3.ini; 

Move F3.outl F4.ini, Move F3.out2 Q. 



(1) 
(2) 
(3) 
(4) 
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The program step at (1) provides the two inputs, INa 201 and the output of function unit F4 240, 
to the function unit Fl; the program step at (2) provides the output of the function unit Fl to the 
input to function unit F2; and so on. In a preferred embodiment, instructions on the same line are 
executed within a single time period, such as a DSP clock period, and instructions on the next 
5 line are executed at a 'nexf time period. The set of the four lines above is executed at each major 
time period, such as a data period. Other conventions for progranrmiing languages, or design 
languages, common in the art, may also be used. Note that by controlling the flow of data among 
function units via a programmable register transfer, the system architecture can be changed 
without a physical change of the system. Note also that this change of system architecture can 
10 include a replacement of a special purpose functional unit by including its function in the 
programmable DSP 310, thereby reducing system cost as programmable DSPs become 
increasingly powerful. In like manner, if advancing technologies allow function units to be 

O combined to reduce costs, the reconfiguration of the system to support such changes can also be 

^ supported via a programming change. 

'^'15 In accordance with another aspect of this invention, the programmable bridge 350 also 

1=^ includes reconfigurable datapath units 430. These datapath units 430 are structured to allow a 

transformation of the data as it is being transferred among the registers 440, 450. In a preferred 
^ embodiment, the datapath units 430 are configurable to provide such functions as addition, 
ffl subtraction, multiplication, and division. Other functions may also be provided. These functions 
L.20 are effected via a command: 
B Config RDUn mode, 

where RDUn is an indentifier of one of the reconfigurable datapath units, and mode is the 

function that is to be executed. The following is an example program that effects the structure of 

FIG. 2B: 

25 Move INa Fl.inl, Config RDUl add, Move INb RDUl.inl, Move F4.outl RDUl.in2, 



Move RDUl.out Fl.in2; (5) 
Config RDU2 mpy. Move Fl.outl RDU2.ini, Move F5.outl RDU2.in2, Move 

RDU2.outl F2.ini; (6) 
Move F2.outl F5.ini, Move F2.outl F3.ini; (7) 
30 Move F3.outl F4.ini, Move F3.out2 Q*. (8) 
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The "Config RDUl add" statement in the program step at (5) configures an RDU 430 in FIG. 4 
to effect an addition function, corresponding to the adder 270 of FIG. 2B. The "Move INb 
RDUl.inl" statement effects a transfer of data from the new input, INb 202, to a first input of 
this RDU 270; "Move F4.outl RDUl.in2" statement effects a transfer of the output of function 
5 unit F4 240 to the other input of this RDU 270; and, "Move RDU Lout FLin2" moves the result 
of the addition at this RDU 270 to the second input of function unit Fl 210. In Hke manner, a 
second RDU is configured as a multipUer via the "Config RDU2 mpy" statement at (6), 
corresponding to the multiplier 260 of FIG. 2B, and the other statements effect the routing of the 
inputs and output of this RDU 260. 
10 As can be seen, by providing a reconfigurable bridge 350, with reconfigurable datapath 

units 430, substantial changes to the system architecture can be effected, via a change to the 
configuration of the bridge 350 and datapath units 430, rather than a change to the underlying 
physical structure of the system. 



^ transfers, where Nl is the total number of inputs and Ml is the total number of outputs being 

interconnected via the register transfers. Although Nl and Ml, and therefore K, can be chosen to 
ffl correspond to the total possible number of inputs N and total possible outputs M from the 
ffilO datapath units 430 and interface registers 440, 450, Nl and Ml are preferably chosen based on 
y heuristics, based on estimates of peak demand, to reduce the cost of the bridge 350. In like 
manner, the number of reconfigurable datapath units 430 is based on estimates of future 
requirements. Typically, a system in a new technology will have a higher proportion of register 
transfer units 420 and datapath units 430 than one that embodies a fairly stable technology, 
25 because of the higher likelihood of change in a new technology. In a preferred embodiment, the 
instructions used to configure the register transfer units comprise Ml*log2M + NPlogjN bits to 
describe the interconnect. 

The foregoing merely illustrates the principles of the invention. It will thus be 
appreciated that those skilled in the art will be able to devise various arrangements which, 
30 although not explicitly described or shown herein, embody the principles of the invention and 
are thus within the spirit and scope of the following claims. 
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The ability of the system to support current and future requirements is dependent upon 
the number of register transfer units in the register transfer units block 420. A set of K 
nonblocking register transfer units will support up to NlxMl <= K simultaneous register 



